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PATENT APPLICATION 
DOCKET NO. 2875.1001-007 



Date: // Ll(0<^^ Express Mail Label No. t:U65^5'ni^Wio US 



Inventors: C. Thomas Caskey, John Shumaker and Andres Metspalu 

5 Attorney's Docket No.: 2875.1001-007 

PARALLEL PRIMER EXTENSION APPROACH 
TO NUCLEIC ACID SEQUENCE ANALYSIS 

RELATED APPLICATIONS 

This application is a Continuation of United States Application No. 08/564,100, 
1 0 which is the United States National Phase Apphcation of PCT/US94/07086, filed June 
22, 1994 which is a Continuation-in-Part Application of Sweden Application No. SE 
9302152-5, filed on June 22, 1993, the entire teachings of which are incorporated herein 
by reference in their entireties. 

GOVERNMENT FUNDING 
15 This invention was made with Government Support under grant number 5-ROl- 

DK3 1 428-1 1 awarded by the National Institutes of Health. The United States 
Government has certain rights in the invention. 

BACKGROUND OF THE INVENTION 

Today, there are two predominant methods for DNA sequence determination: 

2 0 the chemical degradation method (Maxam and Gilbert, Proc. Natl. Acad. Sci, 74:560- 
564 (1977), and the dideoxy chain termination method (Sanger et al, Proc. Natl. Acad. 
Sci. 74:5463-5467 (1977)). Most automated sequencers are based on the chain 
termination method utilizing fluorescent detection of product formation. There are two 
conmion variations of these systems: (1) dye-labeled primers to which 

2 5 deoxynucleotides and dideoxynucleotides are added, and (2) primers to which 

deoxynucleotides and fluorescently labeled dideoxynucleotides are added. Li addition, 
the labeled deoxynucleotides can be used in conjunction with unlabeled 
dideoxynucleotides. This method is based upon the ability of an enzyme to add specific 
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nucleotides onto the 3' hydroxyl end of a primer annealed to a template. The base 
pairing property of nucleic acids determines the specificity of nucleotide addition. The 
extension products are separated electrophoretically on a polyacrylamide gel and 
detected by an optical system utilizing laser excitation. 
5 Although both the chemical degradation method and the dideoxy chain 

termination method are in widespread use, there are many associated disadvantages: for 
example, the methods require gel-electrophoretic separation. Typically, only 400-800 
base pairs can be sequenced from a single clone. As a result, the systems are both time- 
and labor-intensive. Methods avoiding gel separation have been developed in attempts 

10 to increase the sequencing throughput. 

Methods have been proposed by Crkvenjakov (Drmanac, et al., Genomics 4:1 14 
(1989); Strezoska et al., (Proc. Natl. Acad.Sci. USA 88:10089 (1991); Drmanac, et al, 
Science 260: 1649 (1991)) and Bains and Smith (Bains and Smith, J. Theoretical Biol. 
135: 303 (1988)). These sequencing by hybridization (SBH) methods potentially can 

1 5 increase the sequence throughput because multiple hybridization reactions are 

performed simultaneously. This type of system utilizes the information obtained from 
multiple hybridizations of the polynucleotide of interest, using short ohgonucleotides to 
determine the nucleic acid sequence (Dmianac, United States Patent No. 5,202,231). To 
reconstruct the sequence requires an extensive computer search algorithm to determine 

2 0 the optimal order of all fragments obtained from the multiple hybridizations. 

These methods are problematic in several respects. For example, the 
hybridization is dependent upon the sequence composition of the duplex of the 
oligonucleotide and the polynucleotide of interest, so that GC-rich regions are more 
stable than AT-rich regions. As a result, false positives and false negatives during 

2 5 hybridization detection are frequently present and complicate sequence determination. 

Furthermore, the sequence of the polynucleotide is not determined directly, but is 
inferred from the sequence of the known probe, which increases the possibility for error. 
A great need remains to develop efficient and accurate methods for nucleic acid 
sequence determination. 

3 0 SUMMARY OF THE INVENTION 

The current invention pertains to methods for analyzing, and particularly for 
sequencing, a polynucleotide of interest, and an apparatus useftil in analyzing a 
polynucleotide of interest. In one embodiment of the current invention, the nucleotide 
sequence of a polypeptide of interest is analyzed for the presence of mutations or 
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alterations. In a second embodiment of the current invention^ the nucleotide sequence 
of a polypeptide of interest, for which the nucleotide sequence was not known 
previously, is determined. The method comprises detecting single base extension events 
of a set of specific ohgonucleotide primers, such that the label and position of each 
5 separate extension event defines a base in a polynucleotide of interest. 

hi one method of the current invention, a solid support is provided. An array of 
a set or several sets of consecutive oligonucleotide primers of a specified size having 
known sequences is attached at defined locations to the solid support. The 
oligonucleotide primers differ within each set by one base pair. The oligonucleotide 

1 0 primers either correspond to at least a part of the nucleotide sequence of one strand of 
the polynucleotide of interest, if the sequence is known, or represent a set of all possible 
nucleotide sequences for ohgonucleotide primers of the specified size, if the sequence is 
not known. A polynucleotide of interest, which may be DNA or RNA, or a fi-agment of 
the polynucleotide of interest, is annealed to the array of oligonucleotide primers under 

15 hybridization conditions, thereby generating "annealed primers". The annealed primers 
are subjected to single base extension reaction conditions, under which a nucleic acid 
polymerase and terminating nucleotides, such as dideoxynucleotides (ddNTPs) 
corresponding to the four known bases (A, G, T and C), are provided to the annealed 
primers. The terminating nucleotides can also comprise a terminating string of known 

2 0 polynucleotides, such as dinucleotides. As a result of the single base extension reaction, 
extended primers are generated, in which a terminating nucleotide is added to each of 
the annealed primers. The terminating nucleotides can be provided to the annealed 
primers either simultaneously or sequentially. The terminating nucleotides are mutually 
distinguishable; i.e., at least one of the nucleotides is labelled to faciHtate detection. 

2 5 After addition of the terminating nucleotides, the sequence of the polynucleotide of 

interest is analyzed by "reading" the oligonucleotide array: the identity and location of 
each terminating nucleotide within the array on the solid support is observed. The label 
and position of each terminating nucleotide on the solid support directly defines the 
sequence of the polynucleotide of interest that is being analyzed. 

3 0 Li a second method of the current invention, the polynucleotide of interest is 

analyzed for the presence of specific mutations through the use of oligonucleotide 
primers that are not attached to a sohd support. The oligonucleotide primers are tailored 
to anneal to the polynucleotide of interest at a point immediately preceding the mutation 
site(s). If more than one mutation site is examined, the oligonucleotide primers are 
3 5 designed to be mutually distinguishable: in a preferred embodiment, the 
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oligonucleotide primers have different mobilities during gel electrophoresis. For 
example, ohgonucleotides of different lengths are used. After the oligonucleotide 
primers are annealed to the polynucleotide of interest, the annealed primers are 
subjected to single base extension reaction conditions, resulting in extended primers in 
5 which terminating nucleotides are added to each of the annealed primers. As in the first 
method of the current invention, the terminating nucleotides are mutually 
distinguishable. After addition of the terminating nucleotides, the sequence of the 
polynucleotide of interest is analyzed by eluting the extended primers, performing gel 
electrophoresis, and "reading" the gel: the identity and location of each terminating 

1 0 nucleotides on the gel is observed using standard methods, such as with an automated 
DNA sequencer. The label and position of each terminating nucleotide on the gel 
directly defines the sequence of the polynucleotide of interest that is being analyzed, and 
indicates whether a mutation is present. 

The apparatus of the current invention comprises a solid support having an array 

15 of one or more sets of consecutive oHgonucleotide primers with known sequences 

attached to it at defined locations, each oligonucleotide primer differing within each set 
by one base pair. The set of oHgonucleotide primers either corresponds to at least a part 
of the nucleotide sequence of one strand of the polynucleotide of interest, if the 
sequence is known, or represents all possible nucleotide sequences for oligonucleotide 

2 0 primers of the specified size, if the sequence is not known. 

The current invention provides both direct information, due to the detection of a 
specific nucleotide addition, and indirect information, due to the known sequence of the 
annealed primer to which the specific base addition occurred, for the polynucleotide of 
interest. The ability to determine nucleic acid sequences is a critical element of 

2 5 understanding gene expression and regulation. In addition, as advances in molecular 

medicine continue, sequence determination will become a more important element in 
the diagnosis and treatment of disease, 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts an example of a set of oligonucleotide primers (sense primers, 

3 0 SEQ ID NOS: 2-11, antisense primers, SEQ ID NOS: 12-21), comprising consecutive 

primers differing by one base pair at the growing end and capable of hybridizing 
successively along the relevant part(s) or the whole of the polynucleotide of interest 
(SEQ ID NOS: 1 and 22). 
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Figure 2 is a schematic illustration of a single strand template bound to a primer 
which is in turn attached to a solid support. 

Figure 3 illustrates a set of consecutive oligonucleotide primers for a part of the 
polynucleotide of interest following immediately after the primer illustrated in Figure 2. 
5 Figure 4 illustrates the single base pair additions to all the primers illustrated in 

Figure 3, as well as the corresponding additions for the corresponding primers related to 
the complementary strand of the polynucleotide of interest. 

Figure 5 is a graphic depiction of the length of extended primers formed utilizing 
free oligonucleotide primers annealed to a polynucleotide of interest. 
1 0 Figures 6A, 6B and 6C are graphic depictions of electrophoretograms 

demonstrating the detection of the presence of a mutation in a polynucleotide of interest. 

Figures 7A, 7B and 7C depict the results of a DNA chip-based analysis for a 
five-base region within the third exon of the HPRT gene. 

DETAILED DESCRIPTION OF THE INVENTION 

15 The current invention pertains to methods for analyzing the nucleotide sequence 

of a polynucleotide of interest. The method comprises hybridizing all or a fragment of a 
polynucleotide of interest to ohgonucleotide primers, conducting single base extension 
reactions, and detecting the single base extension events. The method can be used to 
analyze the sequence of a polypeptide of interest by examining the sequence for the 

2 0 presence of mutations or alterations in the nucleotide sequence, or by determining the 
nucleotide sequence of a polypeptide of interest. 

As used herein, the term "polynucleotide of interest" refers to the particular 
polynucleotide for which sequence information is wanted. Representative 
polynucleotides of interest include oligonucleotides, DNA or DNA fragments, RNA or 

2 5 RNA fragments, as well as genes or portions of genes. The polynucleotide of interest 

can be single- or double-stranded. The term "template polynucleotide of interest" is 
used herein to refer to the strand which is analyzed, if only one strand of a double- 
stranded polynucleotide is analyzed, or to the strand which is identified as the first 
strand, if both strands of a double- stranded polynucleotide are analyzed. The term, 

3 0 "complementary polynucleotide of interest" is used herein to refer to the strand which is 

not analyzed, if only one strand of a double-stranded polynucleotide is analyzed, or to 
the strand which is identified as the second strand (i.e., the strand that is complementary 
to the first (template) strand), if both strands of a double-stranded polynucleotide are 
analyzed. Either one of the two strands can be analyzed. In a preferred embodiment. 
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both strands of a double-stranded polynucleotide of interest are analyzed in order to 
verify sequence information obtained from the template (first) strand by comparison 
with the complementary (second) strand. Nevertheless, it is not always necessary to 
analyze both strands. For example, if the polynucleotide of interest is being analyzed 
5 for the presence of a single base mutation, and not for the complete base sequence in the 
mutation region, it is sufficient to analyze a single strand of the polynucleotide of 
interest. 

The methods of the current invention can be used to identify the presence of 
mutations or alterations in the nucleotide sequence of a polypeptide of interest. To 

1 0 identify mutations or alterations, the sequence of the polynucleotide of interest is 

compared with the sequence of the native or normal polynucleotide. An "alteration" in 
the polynucleotide of interest, as used herein, refers to a deviation from the expected 
sequence (the sequence of the native or normal polynucleotide), including deletions, 
insertions, point mutations, frame-shifts, expanded oligonucleotide repeats, or other 

1 5 changes. The portion of the polynucleotide of interest that contains the alteration is 
known as the "altered" region. The methods can also be used to determine the 
polynucleotide sequence of a polypeptide of interest having a previously unknown 
nucleotide sequence. 

In one embodiment of the current invention, the polynucleotide of interest is 

2 0 analyzed by annealing the polynucleotide to an array comprising sets of oligonucleotide 
primers. The oligonucleotide primers in the array have a length N, where N is from 
about 7 to about 30 nucleotides, inclusive, and is preferably from 20 to 24 nucleotides, 
inclusive. Each oUgonucleotide primer within each set differs by one base pair. The 
oligonucleotide primers can be prepared by conventional methods (see Sambrook et al., 

2 5 Molecular Cloning: A Laboratory Manual (2nd Ed, 1989)). The sets of 

oligonucleotide primers are arranged into an array, such that the position and nucleotide 
content of each oligonucleotide primer on the array is known. 

The size and nucleotide content of the oligonucleotide primers in the array 
depend on the polynucleotide of interest and the region of the polynucleotide of interest 

3 0 for which sequence information is desired. To analyze a polynucleotide of interest for 

the presence of alterations, consecutive primers differing by one base pair at the 
growing end and capable of hybridizing successively along the relevant part(s) or the 
whole of the polynucleotide are used. An example of such a primer set is shown in 
Figure 1. If only one or a few specific positions of the polynucleotide sequence are 
3 5 examined for alterations, the necessary array of oligonucleotide primers covers only the 
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mutation regions, and is therefore small. If the whole or a major part of the 
polynucleotide of interest is to be analyzed for possible mutations at varying positions, 
the necessary array is larger. For example, the whole hypoxanthine-guanine 
phosphoribosyl-transferase (HPRT) gene can be covered by 900 primers, arranged in a 
5 30 X 30 array; the whole p53 gene requires 700 primers. If both strands of a double- 
stranded polynucleotide of interest are analyzed for the presence of alterations, the array 
comprises consecutive ohgonucleotide primers for the suspected mutation region of 
both the template polynucleotide of interest and the complementary polynucleotide of 
interest. If the polynucleotide of interest has not been sequenced previously, the array 

1 0 includes oligonucleotide primers comprising all possible N-mers. 

The array of sets of ohgonucleotide primers is immobilized to a sohd support at 
defined locations (i.e., known positions). The immobilized array is referred to as a 
"DNA chip", which is the apparatus of the current invention. The solid support can be a 
plate or chip of glass, silicon, or other material. The solid support can also be coated, 

1 5 such as with gold or silver. Coating may facilitate attachment of the oligonucleotide 
primers to the surface of the solid support. The oligonucleotide primers can be bound to 
the solid support by a specific binding pair, such as biotin and avidin or biotin and 
streptavidin. For example, the primers can be provided with biotin handles in 
connection with their preparation, and then the biotin-labelled primers can be attached 

2 0 to a streptavidin-coated support. Alternatively, the primers can be bound by a linker 

arm, such as a covalently bonded hydrocarbon chain, such as a C 10.20 chain. The primers 
can also be bound directly to the solid support, such as by epoxide/amine coupling 
chemistry (see Eggers, M.D. et al., Advances in DNA sequencing Technology, SPIE 
conference proceedings, January 21, 1993), The solid support can be reused, as 

2 5 described in greater detail below. 

In another embodiment of the invention, the polynucleotide of interest is 
analyzed by annealing the polynucleotide to one or more specific ohgonucleotide 
primers that are not attached to a solid support; such oligonucleotide primers are 
referred to herein as "free ohgonucleotide primers". If free ohgonucleotide primers are 

3 0 used, the polynucleotide of interest can be attached to a solid support, such as magnetic 

beads. The free oligonucleotide primers have a length N, as described above, and are 
prepared by conventional methods (see Sambrook et al, Molecular Cloning: A 
Laboratory Manual (2nd Ed, 1989)). The size and nucleotide content of the free 
oligonucleotide primers depend on the polynucleotide of interest and the region of the 
3 5 polynucleotide of interest for which sequence information is desired. To analyze a 
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polynucleotide of interest for the presence of alterations, primers capable of hybridizing 
inunediately adjacent to the relevant part(s) of the polynucleotide are used. If more than 
one position of the polynucleotide sequence is examined for alterations, the free 
oligonucleotide primers are mutually distinguishable: i.e., the oligonucleotide primers 
5 have different mobilities during gel electrophoresis. In a preferred embodiment, 

ohgonucleotides of different lengths are used. For example, an ohgonucleotide primer 
of 10 nucleotides in length is designed to hybridize immediately adjacent to one putative 
mutation, and an ohgonucleotide primer of 12 nucleotides in length is designed to 
hybridize immediately adjacent to a second putative mutation. Because the 
1 0 oligonucleotide primers are of different lengths, they will migrate to different positions 
on the gel. Thus, in this manner, the nucleotide content of each ohgonucleotide primer 
can be identified by the position of the ohgonucleotide primer on the gel. 

The polynucleotide of interest is hybridized to the array of oligonucleotide 
primers, or to the free nucleotide primers, under high stringency conditions, so that an 
%i 1 5 exact match between the polynucleotide of interest and the ohgonucleotide primers is 
"" J obtained, without any base-pair mismatches (see Sambrook et al., Molecular Cloning: 

p A Laboratory Manual (2nd Ed, 1989)). For example, a schematic illustration of a 

2 hypothetical polynucleotide of interest annealed to an oligonucleotide primer that is 

attached to a sohd support is shown schematically in Figure 2. In Figure 2, a part of the 
2 0 sequence of the polynucleotide of interest that follows immediately after the portion of 
^ the polynucleotide that is bound to the oligonucleotide primer on the array is shown as 

ij TGCAACTA. Six corresponding consecutive primers are shown in Figure 3, i.e. 

primers ending with the pairing bases A, AC, ACG, etc. If the polynucleotide of 
interest is double-stranded, it can be separated into two single strands either before or 

2 5 after the binding of the polynucleotide of interest to the array oligonucleotide primers. 

Both the template and the complementary polynucleotide of interest can be analyzed 
utilizing a single array. Thus, while not shown in Figure 2, appropriate primers 
corresponding to the complementary polynucleotide of interest are also attached to the 
solid support in known positions. 

3 0 When the polynucleotide of interest is hybridized to the array of sets of 

oligonucleotide primers, or to the free oligonucleotide primers, under hybridization 
conditions, annealed primers are formed. The term, "annealed primer", as used herein, 
refers to an ohgonucleotide primer (either free or attached to a solid support) to which a 
polynucleotide of interest is hybridized. The annealed primers are subjected to a single 
3 5 base extension reaction. The "single base extension reaction", as used herein, refers to a 
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reaction in which the annealed primers are provided with a reaction mixture comprising 
a DNA polymerase, such as T7 polymerase, and terminating nucleotides under 
conditions such that single terminating nucleotides are added to each of the annealed 
primers. The term "terminating nucleotides", as used herein, refers to either single 
5 terminating nucleotides, or units of nucleotides, the units preferably being dinucleotides. 
hi a preferred embodiment, the terminating nucleotides are single dideoxynucleotides. 
The terminating nucleotides can comprise standard nucleotides, and/or nucleotide 
analogues. The terminating nucleotide added to each annealed primer is thus a base 
pairing with the template base on the polynucleotide of interest, and is added 

1 0 immediately adjacent to the growing end of the respective primer. An oligonucleotide 
primer to which a terminating nucleotide has been added through the single base 
extension reaction is termed an "extended primer". Thus, as schematically shown for 
both strands of the hypothetical polynucleotide of interest in Figure 4, a single 
nucleotide is added to each primer in the array; the primer set related to the strand 

1 5 illustrated in Figure 2 is shown to the left in Figure 4, and the other (complementary) 
strand is shown to the right. The nucleotides added are shown in extra bold type. 

The terminating nucleotides preferably comprise dNTPs, and particularly 
comprise dideoxynucleotides (ddNTPs), but other terminatmg nucleotides apparent to 
the skilled person can also be used. If the terminating nucleotides are single 

2 0 nucleotides, then nucleotides corresponding to each of the four bases (A, T, G and C) 
are utilized in the single base extension reaction. If the terminating nucleotides are 
dinucleotide units, for example, then nucleotides corresponding to each of the sixteen 
possible dinucleotides are utilized. 

The nucleotides are mutually distinguishable. For example, if the solid support 

2 5 is coated with a free electron metal, such as with gold or silver, surface plasmon 

resonance (SPR) microscopy allows identification of each nucleotide, by the change of 
the refractive index at the surface caused by each base extension. Alternatively, at least 
one of the terminating nucleotides is labelled by standard methods to facilitate detection. 
Suitable labels include fluorescent dyes, chemilxmiinescence, and radionuclides. The 

3 0 number of nucleotides that are labelled can be varied. It is sufficient to use three 

labelled terminating nucleotides, the fourth terminating nucleotide being identified by 
its "non-label", if single nucleotides are added in the base extension reaction. For 
example, if one is examining the polynucleotide of interest for the presence of a 
particular alteration, and not for the complete base sequence in the ahered region, three 
3 5 labelled terminating nucleotides are sufficient. Fewer than three labels can also be 
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utilized under appropriate circumstances. An exemplification of the use of two labelled 
and two unlabelled dNTPs is described below. If a specific alteration is to be 
investigated, such as a point mutation, only the native or normal nucleotide need be 
labelled, as a mutation would be indicated by the presence of the "non-label". 
5 Altematively, the expected mutant nucleotide can also be labelled. 

After the single base extension reaction has been performed, the identity and 
location of each terminating nucleotide is observed. If fi:ee oligonucleotide primers are 
used, the extended primers are eluted and separated by gel electrophoresis, and the gel is 
then analyzed. If oligonucleotide primers attached to an array are used, the array itself is 

1 0 analyzed. The gel or array is analyzed by detecting the labelled, terminating nucleotides 
bound to the oligonucleotide primers. The labeled, terminating nucleotides are detected 
by conventional methods, such as by an optical system. For example, a laser excitation 
source can be used in conjunction with a filter set to isolate the fluorescence emission of 
a particular type of terminating nucleotide. Either a photomultiplier tube, a charged- 

15 coupled device (CCD), or another suitable fluorescence detection metnod can be used to 
detect the emitted light fi-om fluorescent terminating nucleotides. 

The sequence of the polynucleotide of interest can be analyzed from the label 
pattern observed on the array or on the gel, since the position of each different primer on 
the array or on the gel is known, and since the identity of each terminating nucleotide 

2 0 can be determined by its specific label. The label and position of each terminating 
nucleotide either within the array or on the gel will directly define the sequence of the 
polynucleotide of interest that is being analyzed. Mutations or alterations in the 
sequence of the polynucleotide of interest are indicated by alterations in the expected 
label pattern. For example, assume that the nucleotide sequence shown in Figure 2 

2 5 contains a mutation: the third base C fi-om the left is replaced by a G in the 

polynucleotide of interest. The top primer in Figure 3 will still be extended by a C as 
shown in Figure 4, whereas the next primer will be extended by a C rather than a G. 
Since this new, unexpected base C can be identified by its specific label and the 
respective primer location is known, the corresponding base mutation is identified as G. 

3 0 The following simple example illustrates the abihty to obtain complete sequence 

information and to identify a mutation in a representative polynucleotide of interest. 
The example utilizes two labelled terminating nucleotides, which give complete 
sequence information. 



Assume a normal polynucleotide of the following base pair composition: 
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+ACTGCTTAG 
-TGACGAATC 

and a corresponding mutant polynucleotide having the following base pair 

composition, which has a single base mutation in the third base pair: 

5 +ACCGCTTAG 
-TGGCGAATC. 

Using fluorescent labeUing, for example, with a red label ("R") for terminating A 

and a green label ("G") for terminating G, and no label, i.e., null ("N") for the remaining 

bases T and C, the following "binary" codes allowing sequence interpretation would be 

1 0 obtained for the normal, mutant and heterozygote sequences, respectively: 

H- N-G-R-N-G-R-R-N-N Normal 
_ R-N-N-G-N-N-N-R-G 

+ N-G-G-N-G-R-R-N-N Mutant (Affected) 

- R-N-N-G-N-N-N-R-G 

15 R 

+ N-G-G-N-G-R-R-N-N Heterozygote (Carrier) 

- R-N-N-G-N-N-N-R-G. 

The presence of such a point mutation will affect the base pairing of the next few 
oligonucleotide primers to the polynucleotide of interest, and thereby the primer 
2 0 extensions obtained, such that the bases in the vicinity of the mutation (i.e., in the 

altered region) may not be accurately identified. To optimize identification of bases in 
the altered region, it is preferred to analyze both strands of such a double-stranded 
polynucleotide of interest. The few bases that may be difficult to identify on the 
template polynucleotide of interest, as well as the changed base, will be identified by the 

2 5 base extensions of the primers for the complementary polynucleotide of interest, as the 

analysis of the complementary polynucleotide of interest approaches the mutation site 
from the opposite direction. In the nearest regions on either side of the alteration, the 
sequence determination is thereby provided by the oligonucleotide primers for one of 
the two strands, 

3 0 The sequence of a polynucleotide of interest for which the sequence is 

previously known can be determined using methods similar to those described above in 
reference to identification of mutations utihzing an array of oUgonucleotide primers. As 
before, the positions of the terminating nucleotides within the array will directly define 
the sequence position of each nucleotide in the polynucleotide of interest. 
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To determine the oligonucleotide sequence, one annealed primer is selected to 
be the "starting" annealed primer; it is supposed for purposes of analysis that the 
sequence of the polynucleotide of interest "starts" with this primer. The nucleotide 
which has been added to the starting annealed primer is detected using standard 
5 methods. Then, a second annealed primer which has the same nucleotide sequence as 
the starting annealed primer, minus the 5' nucleotide and with the addition of the added 
nucleotide, is then selected. The terminating nucleotide which has been added to the 
second annealed primer is detected. These steps are then repeated, using the second 
annealed primer as the "starting" annealed primer in each repetition, until the sequence 

10 of the polynucleotide of interest is determined. For example, if the ohgonucleotide 
primers are 10 nucleotides in length (N = 10), the starting annealed primer is chosen to 
correspond to the first ten bases of the sequence. The terminating nucleotide of the 
starting annealed primer is then determined. Next, bases 2-1 1 (i.e., bases 2-10 of the 
starting annealed primer plus the terminating nucleotide extension) are matched to 

1 5 another annealed primer. This primer is the second annealed primer. The terminating 
nucleotide of the second annealed primer is then determined. These steps are repeated 
to determine the complete sequence. In this manner, the single base extension reaction 
automatically links together the set of annealed primers. 

After analysis of the nucleotide sequence of a polypeptide of interest, the 

2 0 polynucleotide of interest and the terminating nucleotides can be removed from the 
DNA chip, so that the chip can be reused. In a preferred embodiment, the added 
terminating nucleotides are capable of being removed from the solid support after 
analysis of the polynucleotide of interest has been completed. Once the nucleotides are 
removed, the solid support with the immobihzed oligonucleotide primers can be used 

2 5 for a new analysis. The nucleotides can be removed using standard methods, such as 

enzymatic cleavage or chemical degradation. Enzymatic cleavage, for example, would 
use a terminating nucleotide which can be removed by an enzyme. The single base 
extension reaction could result in addition to the ohgonucleotide primers of RNA 
dideoxyTTP or RNA dideoxyCTP by reverse transcriptase or other polymerase. A C/T 

3 0 cleavage enzyme, such as RNase A, can then be used to "strip" off the RNA 

dideoxynucleotides. Alternatively, sulfiir-containing dideoxy-A or dideoxy-G can be 
used during the single extension reaction; a sulfur-specific esterase, which does not 
cleave phosphates can then be used to cleave off the dideoxynucleotides. For chemical 
degradation, a chemically degradable terminating nucleotide can be used. For example, 
3 5 a modified ribonucleotide having its 2'- and 3'-hydroxyl groups esterified, such as by 
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acetyl groups, can be used. After binding of the terminating nucleotide to the annealed 
primer, the acetyl groups are removed by treatment with a base to expose the T- and 3 
hydroxyl groups. The ribose residue can then be degraded by periodate oxidation, and 
the residual phosphate group removed from the annealed primer by treatment with a 
5 base and alkaline phosphatase. 

The method and apparatus of the current invention have uses in detecting 
mutations, deletions, expanded ohgonucleotide repeats, and other genetic abnormalities. 
For example, the current invention can be used to identify frame shifting mutations 
caused by insertions or deletions. Furthermore, carrier status of heritable diseases, such 

10 as cystic fibrosis, p-thalassemia, a-1, Gaucher's disease, Tay Sach's disease, or Lesch- 
Nyham syndrome, can be easily determined using the current invention, because both 
the normal and the altered signals would be detected. Furthermore, mixtures of DNA 
molecules such as occur in HIV infected patients with drug resistance can be 
determined. The HIV virus may develop resistance against drugs like AZT by point 

15 mutations in the nucleotide sequence of a reverse transcriptase (RT) gene. When 
mutated viruses start to appear in the virus population, both the mutated gene and the 
normal (wild type) gene can be detected. The greater the proportion is of the mutant, 
the greater is the signal from the corresponding mutant terminating nucleotide. 

The current invention is fiirther exemplified by the following Examples. 

2 0 EXAMPLE 1 Analyzing the Sequence of a Polynucleotide of Mterest Utilizing Free 
Ohgonucleotide Primers 
An analysis of the hypoxanthine-guanine phosphoribosyl-transferase (HPRT) 
gene (the polypeptide of interest) was conducted for three individuals (Patients A, B, 
and C). 

2 5 A. Obtaining the Polynucleotide of Literest 

The polymerase chain reaction (see Sambrook et al., Molecular Cloning: A 
Laboratory Manual (2nd Ed, 1989), especially chapter 14) was utiUzed to ampUfy the 
polynucleotide of interest. During the reaction, one of the two PGR primers was tagged 
with a biotin group. Following ampUfication, the single strand template was captured 

3 0 with streptavidin coated magnetic beads. For a 50 |il PGR reaction, 25 iil of Dynal M- 

280 paramagnetic beads (Dynal A/S, Oslo, Norway) was used. The supematant of the 
beads was removed and replaced with 50 |al of a binding and washing buffer (10 mM 
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Tris-HCl (pH 7.5); 1 mM EDTA; 2 M NaCl). The PGR product was added to the beads 
and incubated at room temperature for 30 minutes for bead capture of the products. The 
single stranded polynucleotide of interest was isolated by the addition of 150 fxl of 0.15 
M NaOH for 5 minutes. The beads were captured, the supernatant was removed, and 
5 150^1 of 

0.15 M NaOH was again added for five minutes. Following denaturation, the beads 
were washed once with 150 |il of 0.15 M NaOH and twice with IX T7 annealing buffer 
(40 mM Tris-HCl (pH 7.5); 20 mM MgCl2; 50 mM NaCl). The beads were finally 
suspended in 70 jil of water. This process both isolates the single-stranded 
1 0 polynucleotide of interest and removes any unincorporated dNTPs remaining after PGR. 

B. Analyzing the Sequence of the Polynucleotide of hiterest 
After single strand isolation, the oligonucleotide primers were annealed to the 
polynucleotide of interest by heating to 65°C for approximately two minutes and 
cooling to room temperature over approximately 20 minutes. The 10 [il reaction 

1 5 volume consisted of 7 [il of the polynucleotide of interest (0.5-1 pmol), 2 \il of 5X T7 
annealing buffer, and 1 [xl of extension primer (3-9 pmol). The extension reaction was 
then performed. For the reaction, 1 \i\ of DTT, 2 |il of T7 polymerase (diluted 1 : 8) and 
1 ^,1 of ddNTPs (final concentration of 0,5 uM) were added. The reaction proceeded at 
3TC for two minutes, and then was stopped by the addition of 100 |J,1 of washing buffer 

2 0 (IXSSPE, 0.1% SDS, 30% ethanol). The beads were washed twice with 150 |il of the 
washing buffer. The extension products were eluted by the addition of 5 |il of 
formamide andheated to 70°G for two minutes. The beads were captured by the magnet 
and the supematant containing the extension products was collected and analyzed on a 
ABI 373 (Applied Biosystems, hic). OUgonucleotide primers of lengths varying from 

2 5 10 to 17 were used. As shown in Figure 5, extension products were formed efficiently. 

1 . Deoxynucleotide Labelling - Four Fluorophores 
Each ddNTP was labelled by a different fluorophore, ABI Dye Terminator dyes 
designed for taq polymerase were used: ddG is blue, ddA is green, ddT is yellow, and 
ddG is red. Four fluorescent ddNTPs were added to each reaction tube. The extension 

3 0 products were purified, gel separated, and analyzed on an ABI 373. Two different bases 

of exon 3 of the HPRT gene were analyzed: base 16534 (wild type is A) and base 
16620 (wild type is G). 
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The results of the four fluor, single lane, indicated that the presence of mutations 
could be identified easily. All three patients are wild type for A at base 16534 (data not 
shown). Electrophoretograms shown in Figures 6A, 6B and 6C indicate that Patient A 
is wild type (C) at base 16620 (Figure 6 A), patient B is a mutated individual (C">T) at 
5 base 16620 (Figure 6B), and patient C is a carrier at base 16620 (both C and T) (Figure 
6C). 

2. Deoxynucleotide Labelling - Single Fluorophore 

Each ddNTP was labelled by the same fluorophore. DuPont NEN fluorescein 
dyes (NEL 400-404) were used. Each ddNTP appears blue in the ABI 373. Only one 

1 0 fluorescent ddNTP is added to each reaction tube. The extension products were 

purified, gel separated, and analyzed on an ABI 373. Four lanes on the gel must be used 
to analyze each base. Two different bases of exon 3 of the HPRT gene were analyzed: 
base 16534 (wild type is A) and base 16620 (wild type is C). 

The resuhs of the single fluor, four lane, demonstrated results that were identical 

15 to those obtained using the four fluor, single lane method described in (1), above. This 
type of assay minimizes the effect of the fluorophore differences during extension 
product formation and gel separation, 

3 . Deoxynucleotide Labelling - Biotinlyated Dideoxynucleotides 
The ddNTPs are labelled with a biotin group. Four separate reactions are 

2 0 performed, whereby only one of the four ddNTPs is biotinylated. Following the 

extension reaction, a strepavidin (or avidin) coupled fluorescent group is attached to the 
biotinylated ddNTPs. Because the biotin group is small, uniform incorporation of the 
ddNTPs is expected and base-specific differences in extension are minimized. 
Furthermore, the fluorescent signal can be ampUfied because the biotin group can bind a 

2 5 steptavidin moiety coupled to multiple fluors. 

EXAMPLE 2 Analyzing the Sequence of a Polynucleotide of Interest Utilizing 
Labelled Deoxynucleotides 
An analysis of the hypoxanthine-guanine phosphoribosyl-transferase (HPRT) 
gene (the nucleotide sequence of a polypeptide of interest) was conducted for three 

3 0 individuals (Patients A, B, and C). The third exon of the HPRT gene was examined. 

Microscope glass slides were epoxysilanated at 80°C for eight hours using 25% 
3' glycidoxy propyltriethoxysilane (Aldrich Chemical) in dry xylene (Aldrich Chemical) 
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with a catalytic amount of diisopropylehylamine (Aldrich Chemical), according to 
Southern (Nucl. Acids Res. 20:1679 (1992), and Genomics 13:1008 (1992)). The DNA 
chips were made by placing 0.5 |il drops of 5*-amino-linked ohgonucleotides (50 |j,M, 
0.1 M NaOH) at 37°C for six hours in a humid environment. The chips were washed in 
5 50°C water for 15 minutes, dried and used. The annealing reaction consisted of adding 
2.2 |al of single-stranded DNA (0.1 [iM in T7 reaction buffer) to each grid position, 
heating the chip in a humid environment to 70°C and then cooling slowly to room 
temperature. A 1 \xl drop of 0.1 M DTT, 3 units of Sequenase Version 2.0 (USB), 5 ^Ci 
a-^^P dNTP (3000 Ci/mmol) (DuPont NEN) and noncompeting unlabeled 18.5 |iM 
1 0 ddNTPs (Pharmacia) were added to each grid position for three minutes. The reaction 
was stopped by washing in 75°C water, and analyzed on a Phosphorlmager (Molecular 
Dynamics). 

Figures 7A, 7B and 7C depict the results of a DNA chip-based analysis for a 
five-base region within the third exon of the HPRT gene. The rows correspond to a 
1 5 particular base under investigation, and the coluitms correspond to the labeled base. 

Figure 7A demonstrates the wild type sequence (TCGAG), Figure 7B demonstrates a C- 
->T mutation, and Figure 7C demonstrates a C~>T mutation. 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
2 0 described specifically herein. Such equivalents are intended to be encompassed in the 
scope of the following claims. 
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CLAIMS 

What is claimed is: 

A method of analyzing the sequence of a polynucleotide of interest, comprising 
the steps of: 

a) annealing a polynucleotide of interest to free oligonucleotide primers 
having known sequences of N nucleotides in length to generate annealed 
primers; 

b) subjecting the annealed primers to a single base extension reaction to 
extend the annealed primers by the addition of a terminating nucleotide; 

c) observing the identity of each terminating nucleotide that has been added 
to the annealed primers. 

A method of analyzing the sequence of a polynucleotide of interest, comprising 
the steps of: 

a) annealing a polynucleotide of interest to oligonucleotide primers having 
known sequences of N nucleotides in length under hybridization 
conditions, to generate annealed primers; 

b) subjecting the annealed primers to a single base extension reaction which 
comprises providing to the annealed primers nucleotides corresponding 
to each of the four bases, to extend the annealed primers by the addition 
of a terminating nucleotide; 

c) observing the identity and location of each terminating nucleotide that 
has been added to the annealed primers. 

A method of analyzing the sequence of a polynucleotide of interest, comprising 
the steps of: 

a) attaching an array of ohgonucleotide primers having known sequences of 
N nucleotides in length to a solid support at known locations; 

b) annealing the polynucleotide of interest to the array of ohgonucleotide 
primers to generate annealed primers; 

c) subjecting the annealed primers to a single base extension reaction to 
extend the annealed primers by the addition of a terminating nucleotide; 
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d) observing the identity and location of each terminating nucleotide within 
the array on the sohd support. 

A method of analyzing the sequence of a polynucleotide of interest, comprising 
the steps of: 

a) attaching an array of oligonucleotide primers having known sequences of 
N nucleotides in length to a sohd support at known locations; 

b) annealing the polynucleotide of interest to the array of ohgonucleotide 
primers to generate annealed primers; 

c) subjecting the annealed primers to a single base extension reaction to 
extend the annealed primers by the addition of a terminating nucleotide; 

d) selecting a starting annealed primer; 

e) observing the identity and location of the terminating nucleotide which 
has been added to the starting annealed primer^ to determine the next 
nucleotide in sequence; 

f) selecting a second annealed primer which has the same nucleotide 
sequence as nucleotides 2 through N of the starting annealed primer 
nucleotide plus the next nucleotide in sequence as determined in step (e), 
and 

g) repeating steps (e) and (f), using the second annealed primer as the 
starting annealed primer for each repetition, to determine the sequence of 
the polynucleotide of interest. 

A method of analyzing the sequence of a polynucleotide of interest, comprising 
the steps of: 

a) attaching an array of oligonucleotide primers, having known sequences 
of N nucleotides in length to a sohd support at defined locations; 

b) annealing the polynucleotide of interest to the array of ohgonucleotide 
primers under hybridization conditions, to generate annealed primers; 

c) subjecting the annealed primers to a single base extension reaction which 
comprises providing to the annealed primers nucleotides corresponding 
to each of the four bases, to extend the annealed primers by the addition 
of a terminating nucleotide; 

d) observing the identity and location of each terminating nucleotide within 
the array on the solid support. 



-19- 

6. A method of analyzing the sequence of a polynucleotide of interest, comprising 
the steps of: 

a) attaching an array of ohgonucleotide primers, having known sequences 
of N nucleotides in length to a solid support at defined locations; 

b) annealing the polynucleotide of interest to the array of oligonucleotide 
primers under hybridization conditions, to generate armealed primers; 

c) subjecting the annealed primers to a single base extension reaction which 
comprises providing to the annealed primers nucleotides corresponding 
to each of the four bases, to extend the annealed primers by the addition 
of a terminating nucleotide; 

d) selecting a starting annealed primer; 

e) observing the identity and location of the terminating nucleotide which 
has been added to the starting annealed primer, to determine the next 
nucleotide in sequence; 

f) selecting a second annealed primer which has the same nucleotide 
sequence as nucleotides 2 through N of the starting annealed primer 
nucleotide plus the next nucleotide in sequence as determined in step (e), 
and 

g) repeating steps (e) and (f), using the second armealed primer as the 
starting annealed primer for each repetition, to determine the sequence of 
the polynucleotide of interest. 

7. The method of any one of Claims 1 to 6, wherein the single base extension 
reaction comprises subjecting the annealed primers to a reaction mixture 
comprising a polymerase and nucleotides corresponding to each of the four 
bases. 

8. The method of any one of Claims 5 to 7, wherein the nucleotides corresponding 
to each of the four bases are mutually distinguishable. 

9. The method of Claim 8, wherein three of the four nucleotides are differently 
labelled. 



1 0. The method of Claim 9, wherein the three differently labelled nucleotides are 
fluorescently labelled. 
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1 1 . The method of any one of Claims 1 to 1 0, further comprising analyzing the 
sequence of the complementary polynucleotide of interest. 

12. The method of any one of Claims 1 to 1 1, wherein the terminating nucleotides 
are dideoxynucleotides. 

1 3 . The method of any one of Claims 1 to 1 2, wherein the length N of the 
oligonucleotide primers is between 7 and 30 inclusive, 

14. The method of any one of Claims 1 to 13, wherein the length N of the 
oligonucleotide primers is between 20 and 24 inclusive. 

15. The method of any one of Claims 1, 2, 13 or 14, wherein the oligonucleotide 
primers comprise ohgonucleotide primers of different lengths, 

16. The method of any one of Claims 1 to 15, wherein observing the identity and 
location of a terminating nucleotide comprises the use of a charge coupled 
device or a photomultiplier tube. 

17. The method of any one of Claims 3 to 14 or 16, wherein the terminating 
nucleotides are removed from the annealed primers after completed analysis to 
prepare the solid support for reuse. 

18. The method of any one of Claims 1 to 17, wherein the terminating nucleotides 
are dinucleotides. 

19. An apparatus for analyzing the sequence of a polynucleotide of interest, 
comprising a solid support having attached thereon at defined locations an array 
of oligonucleotide primers having known sequences. 

20. The apparatus of Claim 19, wherein the oligonucleotide primers are attached to 
the solid support by a specific binding pair. 

21 . The apparatus of Claim 20, wherein the specific binding pair is biotin and a 
molecule selected from the group consisting of: avidin and strepavidin. 
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PARALLEL PRIMER EXTENSION APPROACH 
TO NUCLEIC ACK) SEQUENCE ANALYSIS 

ABSTRACT OF THE DISCLOSURE 

A method of analyzing a polynucleotide of interest, comprising providing one or 
more sets of consecutive oligonucleotide primers differing within each set by one base 
at the growing end therof; annealing a single strand of the polynucleotide or a jfragment 
of the polynucleotide to the oligonucleotide primers under hybridization conditions; 
subjecting the primers to single base extension reactions with a polymerase and 
terminating nucleotides, the terminating nucleotides being mutually distinguishable; and 
observing the location and identity of each terminating nucleotide to thereby analyze the 
sequence or a part of the nucleotide sequence of the polynucleotide of interest, is 
disclosed. An apparatus comprising a sohd support to which is attached at defined 
locations thereon one or more sets of consecutive oligonucleotide primers differing 
within each set by one base at the growing end thereof is also described. 
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attorney (s) and/or agent (s) to file and prosecute an international application 
under the Patent Cooperation Treaty based upon the above-identified 
application, including a power to meet all designated office requirements for 
designated states: 

I David E. Brook Reg. No. 22,592 

t James M. Smith Reg. No. 28,043 

Leo R. Reynolds Reg. No. 20,884 

Patricia Granahan Reg. No. 32,227 

Mary Lou Wakimura Reg. No. 31,804 

'Jail of Hamilton, Broo)c, Smith and Reynolds, P.C., Two Militia Drive, 
•f Lexington, Massachusetts 02173; 

and 

Send correspondence to; Leo R. Reynolds, Esq. 

Hamilton, Brook, Smith & Reynolds, PIC, 

Two Militia Driv^, UxxngtoTit M 02173 

Direct telephone calls to: Leo R> Reynolds, Esq. 

617-861-6240 Fax: 617-861-9540 



I hereby declare that all statements made herein of roy own knowledge are 
true and that all statements made on information and belief are believed to be 
true; and further that these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 



Thomas O. Hoover 
Alice Carroll 
N. Scott Pierce 
Richard A, Wise 
Elizabeth W. Mata 



Reg. No. 32,470 
Reg. No. 33,542 
Reg. No. 34,900 
Reg. No. 18,041 
Reg. No. 38,236 



Full name of sole 
or first inventor 

Inventor ' s 
Signature 

Residence 



C. Thppgbs Caskey 




Dat e , 2i<Q 



Houston, Texas 77005 



Citizenship US 

Post Office Address same 
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Full name of second joint 

inventor, if any John Shumaker 

Second Inventor's 

Signature Date 

Residence 719 Pizer 

Houston, Texas 77009 

Citizenship 

Post Office Address same 



Full name of third joint 

inventor, if any Andres Metspalu 

Third Inventor's 

Signature Date^ 

Residence^ 4-33 Kaunase Street 

J Tartu 2400 Estonia 



^^Jcitizenship Estonia 

;'Post Office Address same 



|1fu11 name of fourth joint 

inventor, if any ^ [ 

"^Fourth Inventor's 

7 Signature 

i^esidence 



rlCitizenship^ 



Post Office Address 



Full name of fifth joint 

inventor, if any 

Fifth Inventor's 

S ignature _p a-te_ 

Residence - 



Citizenship 

Post Office Address 
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>%J>^y IN ^HE UNITED STATES PATENT AND TRADEMARK OFFICE 



Declaration for Patent Application 



As a named inventor I hereby declare that: 

My residence/ post office address and citizenship are as stated next to 

my name; 

I believe X am the original, first and sole inventor (if only one name 
is listed) or an original, first and joint inventor (if plural names are 
listed in the signatory page(s) commencing at page 3 hereof) of the subject 
matter which is claimed and for which a patent is sought on the invention 
entitled 

Parallel Primer Extension Approach to Nucleic Acid 

Sequence Analysis 

the specification of which (check one) 



[ ] is attached hereto* 

[ X ] was filed on 22 June 1994 as United States Application 

Number or PCX International Application Serial No. PCT/gS94/07086 
(U.S.S,N. 08/564,100) 

and was amended on (if applicable) • 

I hereby state that I have reviewed and understand the contents of the 
above-identified specification, including the claims, as amended by any 
- amendnient referred to above* 

I acknowledge the duty to disclose information which is known by me to 
be material to patentability as defined in 37 C-F,R» §1*56. 



I hereby claim foreign priority benefits under Title 35, United States 
Code, §119 of any foreign application (s) for patent or inventor's certificate 
listed below and have also identified below any foreign application for patent 
or inventor's certificate having a filing date before that of the application 
on which priority is claimed: 





Prior Foreian Application's ) 


SE9302152-5 


Sweden 


22 June 1993 


(Number) 


(Country) 


(Day /Month/year filed) 


(Number) 


(Country) 


(Day/ Month/ Year filed) 



Priority 
Not 

Claimed 



(Number) (Country) (Day /Month/Year filed) 



I hereby claim the benefit under 35 U-S.C, S119(e;''of any United States 
provisional application(s) listed below. 



(Application Number) ' (Filing Date) 



(Application Nuinber) (Filing Date) 
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1 hereby claim the benefit under Title 35, United States Code, §120 of any 
United States application( s) listed below and, insofar as the subject matter 
of each of the claims of this application is not disclosed in the prior United 
States application in the manner provided by the first paragraph of Title 35, 
United States Code, §112, I acknowledge the duty to disclose information known 
by me to be material to patentability as defined in 37 CF.R. §1.56 which 
occurred between the filing date of the prior application and the national or 
PCT international filing date of this application: 



(Application Serial No#) (Filing date) (Status, patented/ pending, abandoned) 

(Application Serial No.) (Filing date) (Status, patented, pending, abandoned) 

As a named inventor, I hereby appoint the following attorney(s> and/or 
agent (s) to prosecute this application and transact all business in the Patent 
and Trademark Office connected therewith. 

I also hereby grant additional Powers of Attorney to the following 
attorney (s) and/or agent (s) to file and prosecute an international application 
under the Patent Cooperation Treaty based upon the above-identified 
application, including a power to meet all designated office requirements for 
designated states: 

Thomas O. Hoover Reg, No* 32,470 
Alice Carroll Reg. No. 33,542 
Scott Pierce Reg. No. 34,900 
Richard A* Wise Reg. No, 18,041 

Elizabeth' Mata Reg. No. 38,236 

gall of Hamilton, Brook, Smith and Reynolds, P.C., Two Militia Drive, 
Tiexington, Massachusetts 02173; 

01 and 

Send correspondence to? Leo R. Reynolds, Esq> 

\, Hamilton, Brook, Smith & Reynolds, PIC. 

Two Militia Drive. Leyinfton- MA 0717^ 

u\ Direct telephone calls to: Leo R. Reynolds, Esq. 

□ 617-861-6240 Fax: 617-861-9540 



I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information and belief are believed to be 
true; and further that these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United statee 
Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon* 



Full name of sole 

or first inventor Thomas Caskey 

Inventor's 

signature Date 

Re s iden ce _^ 6402 Belmont 

Houston, Texas 77005 



David E* Brook 
James M. Smith 
Leo R. Reynolds 
Patricia Granahan 
Mary Lou Wakimura 



Reg» No, 22,592 

Reg. No* 28,043 

Reg. No. 20,884 

Reg. No. 32,227 

Reg. No. 31,804 



Citizenship 

post Office Address 



same 
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Full name of second joint 

inventor, if any John Shumaker 

Second Inventor's 

Signature 
Residence 719 Flzer 



Date ^ /^f(o 

^ 710 P-TfTfiY- 



Houston, Texas 77009 



Citizenship tJS 



post Office Address same 



Full name of third joint 
inventor, if any 

Third Inventor's 

S ignat u re 

Residence 



Andres Metspalu 

^Date^ 

4-33 Kaunase Street 



^ Tartu 2400 Estonia 

5@Ltizenship Estonia ; 

M>st Office Address same 



lE^ll name of fourth joint 

inventor, if any ^ ^_ „ 

pourth Inventor's 

\^ signature _^ Date^ 

kfesidence . 



j&tizenship 

ilPbst Office Address 



Full name of fifth joint 

inventor, if any 

Fifth Inventor's 

Signature P^te 

Residence ^ 



Citizenship 

Post Office Address 
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November 13, 2000 



PATENT APPLICATION 
Docket No.: 2875.1001-007 



Date: /y/a/^^ 
EXPRESS MAIL LABEL NO. G^I^SS9S~10^06i.US 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
Applicants: C. Thomas Caskey, John M. Shumaker, and Andres Metspalu 

Title: Parallel Primer Extension Approach to Nucleic Acid Sequence Analysis 

TRANSMITTAL OF SEQUENCE LISTING IN COMPUTER READABLE FORM 
IN COMPLIANCE WITH 37 C.F.R. §§1.82ire') AND (f) 

Box Sequence 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

Transmitted herewith is a copy of the "Sequence Listing" in computer readable 
foiTO as required by 37 C.F.R. §1.821(e). As required by 37 C.F.R. §1.821(f), 
AppHcant's Attorney hereby states that the content of the "Sequence Listing" in paper 
form and of the computer readable form of the "Sequence Listing" are the same. 



Respectfully submitted, 

HAMILTON, BROOK, SMITH & REYNOLDS, P.C. 




Sandra A. Brockman 



Registration No. 44,045 
Telephone (781) 861-6240 
Facsimile (781) 861-9540 
Lexington, Massachusetts 02421-4799 

Date: /t^rve^ntjac 12,200 
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SEQUENCE LISTING 

<110> Caskey, C. Thomas 
S humake r , John 
Metspalu, Andres 

<120> PARALLEL PRIMER EXTENSION APPROACH TO 
NUCLEIC ACID SEQUENCE ANALYSIS 



<130> 2875.1001-007 

<150> US 08/564, 100 
<151> 1994-06-22 

<150> SE 9302152-5 
<151> 1993-06-22 

<160> 22 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> polynucleotide of interest 
<400> 1 

gatagcaatc gcttacggta atccggcctg 

<210> 2 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 2 
gatagcaatc 

<210> 3 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 3 
atagcaatcg 

<210> 4 
<211> 10 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Oligonucleotide primer 

<400> 4 
tagcaatcgc 

<210> 5 

<211> 10 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 5 
agcaatcgct 

<210> 6 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 6 
gcaatcgctt 

<210> 7 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 7 
caatcgctta 

<210> 8 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 8 
aatcgcttac 

<210> 9 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 



<400> 9 
atcgcttacg 

<210> 10 

<211> 10 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 10 
tcgcttacgg 

<210> 11 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 11 
cgcttacggt 

<210> 12 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 12 
gaatgccatt 

<210> 13 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 13 
aatgccatta 

<210> 14 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide primer 



<400> 14 
atgccattag 



<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 15 
tgccattagg 

<210> 16 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 16 
gccattaggc 

<210> 17 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 17 
ccattaggcc 

<210> 18 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 18 
cattaggccg 

<210> 19 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 19 
attaggccgg 

<210> 20 
<211> 10 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide primer 

<400> 20 
ttaggccgga 

<210> 21 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 21 
taggccggac 

<210> 22 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> polynucleotide of interest 
<400> 22 

caggccggat taccgtaagc gattgctatc 



